智能论文笔记

Multi-objective Optimization of Notifications Using Offline Reinforcement Learning

Prakruthi Prabhakar , Yiping Yuan , Guangyu Yang , Wensheng Sun , Ajith Muralidharan

分类：机器学习 | (统计)机器学习

2022-07-07

移动通知系统在各种应用程序中起着重要作用，以通信，向用户发送警报和提醒，以告知他们有关新闻，事件或消息的信息。在本文中，我们将近实时的通知决策问题制定为马尔可夫决策过程，在该过程中，我们对奖励中的多个目标进行了优化。我们提出了一个端到端的离线增强学习框架，以优化顺序通知决策。我们使用基于保守的Q学习的双重Q网络方法来应对离线学习的挑战，从而减轻了分配转移问题和Q值高估。我们说明了完全部署的系统，并通过离线和在线实验证明了拟议方法的性能和好处。

translated by 谷歌翻译

Scale-Invariant Specifications for \\Human-Swarm Systems

Joel Meyer , Ahalya Prabhakar , Allison Pinosky , Ian Abraham , Annalisa Taylor , Millicent Schlafly , Katarina Popovic , Giovani Diniz , Brendan Teich , Borislava Simidchieva

分类：机器人

2022-12-06

We present a method for controlling a swarm using its spectral decomposition -- that is, by describing the set of trajectories of a swarm in terms of a spatial distribution throughout the operational domain -- guaranteeing scale invariance with respect to the number of agents both for computation and for the operator tasked with controlling the swarm. We use ergodic control, decentralized across the network, for implementation. In the DARPA OFFSET program field setting, we test this interface design for the operator using the STOMP interface -- the same interface used by Raytheon BBN throughout the duration of the OFFSET program. In these tests, we demonstrate that our approach is scale-invariant -- the user specification does not depend on the number of agents; it is persistent -- the specification remains active until the user specifies a new command; and it is real-time -- the user can interact with and interrupt the swarm at any time. Moreover, we show that the spectral/ergodic specification of swarm behavior degrades gracefully as the number of agents goes down, enabling the operator to maintain the same approach as agents become disabled or are added to the network. We demonstrate the scale-invariance and dynamic response of our system in a field relevant simulator on a variety of tactical scenarios with up to 50 agents. We also demonstrate the dynamic response of our system in the field with a smaller team of agents. Lastly, we make the code for our system available.

translated by 谷歌翻译

A Domain-specific Perceptual Metric via Contrastive Self-supervised Representation: Applications on Natural and Medical Images

Hongwei Bran Li , Chinmay Prabhakar , Suprosanna Shit , Johannes Paetzold , Tamaz Amiranashvili , Jianguo Zhang , Daniel Rueckert , Juan Eugenio Iglesias , Benedikt Wiestler , Bjoern Menze

分类：计算机视觉

2022-12-03

Quantifying the perceptual similarity of two images is a long-standing problem in low-level computer vision. The natural image domain commonly relies on supervised learning, e.g., a pre-trained VGG, to obtain a latent representation. However, due to domain shift, pre-trained models from the natural image domain might not apply to other image domains, such as medical imaging. Notably, in medical imaging, evaluating the perceptual similarity is exclusively performed by specialists trained extensively in diverse medical fields. Thus, medical imaging remains devoid of task-specific, objective perceptual measures. This work answers the question: Is it necessary to rely on supervised learning to obtain an effective representation that could measure perceptual similarity, or is self-supervision sufficient? To understand whether recent contrastive self-supervised representation (CSR) may come to the rescue, we start with natural images and systematically evaluate CSR as a metric across numerous contemporary architectures and tasks and compare them with existing methods. We find that in the natural image domain, CSR behaves on par with the supervised one on several perceptual tests as a metric, and in the medical domain, CSR better quantifies perceptual similarity concerning the experts' ratings. We also demonstrate that CSR can significantly improve image quality in two image synthesis tasks. Finally, our extensive results suggest that perceptuality is an emergent property of CSR, which can be adapted to many image domains without requiring annotations.

translated by 谷歌翻译

Hybrid-SD (H_SD): A new hybrid evaluation metric for automatic speech recognition tasks

Zitha Sasindran , Harsha Yelchuri , Supreeth Rao , T. V. Prabhakar

分类：自然语言处理

2022-11-03

Many studies have examined the shortcomings of word error rate (WER) as an evaluation metric for automatic speech recognition (ASR) systems, particularly when used for spoken language understanding tasks such as intent recognition and dialogue systems. In this paper, we propose Hybrid-SD (H_SD), a new hybrid evaluation metric for ASR systems that takes into account both semantic correctness and error rate. To generate sentence dissimilarity scores (SD), we built a fast and lightweight SNanoBERT model using distillation techniques. Our experiments show that the SNanoBERT model is 25.9x smaller and 38.8x faster than SRoBERTa while achieving comparable results on well-known benchmarks. Hence, making it suitable for deploying with ASR models on edge devices. We also show that H_SD correlates more strongly with downstream tasks such as intent recognition and named-entity recognition (NER).

translated by 谷歌翻译

Physics-Constrained Neural Network for the Analysis and Feature-Based Optimization of Woven Composites

Haotian Feng , Sabarinathan P Subramaniyan , Pavana Prabhakar

分类：机器学习

2022-09-19

编织的复合材料是通过隔板和纬纱以图案或编织方式进行的。通过更改图案或材料，可以显着改变编织复合材料的机械性能。但是，尚不清楚编织复合体系结构（图案，材料）在机械性能上的作用。在本文中，我们通过我们提出的物理受限的神经网络（PCNN）探讨了编织复合体系结构（编织模式，编织材料序列）与相应模量之间的关系。此外，我们采用统计学习方法来优化编织复合体系结构以改善机械响应。我们的结果表明，PCNN可以有效地预测所需模量的编织体系结构，其精度比几种基线模型高得多。 PCNN可以与基于功能的优化相结合，以确定初始设计阶段的最佳编织复合体系结构。除了将编织复合体系结构与其机械响应联系起来外，我们的研究还提供了对建筑特征如何控制机械响应的深入了解。我们预计我们提出的框架将主要促进编织的综合分析和优化过程，并成为将物理知识引导的神经网络引入复杂结构分析的起点。

translated by 谷歌翻译

Bayesian Statistical Model Checking for Multi-agent Systems using HyperPCTL*

Spandan Das , Pavithra Prabhakar

分类：人工智能 | 机器人

2022-09-06

在本文中，我们提出了一种用于在离散时间马尔可夫链（DTMC）上指定的概率超普通统计模型检查（SMC）的贝叶斯方法。尽管使用顺序概率比测试（SPRT）的HyperPCTL*的SMC曾经探索过，但我们基于贝叶斯假说检验开发了一种替代SMC算法。与PCTL*相比，由于它们在DTMC的多个路径上同时解释，验证HyperPCTL*公式是复杂的。此外，由于SMC无法返回Subformulae的满意度问题，因此扩展非稳定设置的自下而上的模型检查算法并不直接，相反，它仅通过高级返回正确的答案。信心。我们根据修改后的贝叶斯测试，提出了一种HyperPCTL* SMC的递归算法，该测试因递归满意度结果的不确定性而导致。我们已经在Python工具箱Hybrover中实现了算法，并将我们的方法与基于SPRT的SMC进行了比较。我们的实验评估表明，我们的贝叶斯SMC算法在验证时间和推断给定HyperPCTL*公式的满意度所需的样品数量方面的性能更好。

translated by 谷歌翻译

Robotic Untangling of Herbs and Salads with Parallel Grippers

Prabhakar Ray , Matthew Howard

分类：机器人

2022-08-09

从一个或多个未分类桩中挑选一个或多个物体对于机器人系统而言仍然是不平凡的。当桩由包含彼此纠缠的单个项目的颗粒材料（GM）组成时，尤其如此，导致挑选出更多的选择。这种容易发生的GM的关键特征之一是从桩中的主要物体延伸的突起存在。这项工作描述了后者在引起机械纠缠及其对选择一致性的影响方面所扮演的角色。 IT报告了实验，其中采摘具有不同突出长度（PLS）的GMS导致挑选质量差异增加了76％，这表明PL是采摘策略设计中的一项信息功能。此外，为了应对这种效果，它提出了一种新的传播（SNP）方法，可大大减少纠结，从而使选择更加一致。与试图从桩中的无缠结点进行选择的先前方法相比，提出的方法导致选择误差（PE）的降低高达51％，并显示出对先前看不见的GMS的良好概括。

translated by 谷歌翻译

Segmentation Guided Deep HDR Deghosting

K. Ram Prabhakar , Susmit Agrawal , R. Venkatesh Babu

分类：计算机视觉

2022-07-04

我们提出了一种运动分割引导的卷积神经网络（CNN）方法，以进行高动态范围（HDR）图像磁化。首先，我们使用CNN分段输入序列中的移动区域。然后，我们将静态区域和移动区域分别与不同的融合网络合并，并结合融合功能以生成最终的无幽灵HDR图像。我们的运动分割引导的HDR融合方法比现有的HDR脱胶方法具有显着优势。首先，通过将输入序列分割为静态和移动区域，我们提出的方法可以为各种具有挑战性的饱和度和运动类型学习有效的融合规则。其次，我们引入了一个新颖的存储网络，该网络积累了在饱和区域中生成合理细节所需的必要功能。所提出的方法在两个公开可用的数据集上优于九种现有的最新方法，并生成视觉上令人愉悦的无幽灵HDR结果。我们还提供了3683个不同暴露图像的大规模运动细分数据集，以使研究社区受益。

translated by 谷歌翻译

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

Raju Shrestha , Daniel Romero , Sundeep Prabhakar Chepuri

分类：机器学习

2022-01-11

无线电贴图在无线通信和移动机器人任务中找到了许多应用，包括资源分配，干扰协调和任务规划。尽管已经提出了许多技术来构造来自空间分布测量的无线电映射，但是预先假定了这种测量的位置的位置。相反，本文提出了频谱测量，其中诸如无人航空车辆（UAV）的移动机器人在主动选择的一组位置处收集测量以在短测量时间内获得高质量地图估计。这是以两步执行的。首先，设计了两种新颖的算法，基于模型的在线贝叶斯估计器和数据驱动的深度学习算法，以更新地图估计和指示每个可能位置的测量信息的信息性。这些算法提供互补的益处，并且每次测量都具有恒定的复杂性。其次，不确定度量用于规划无人机的轨迹，以在最具信息地的位置收集测量。为了克服这个问题的组合复杂性，提出了一种动态编程方法，以通过线性时间的大不确定性的区域获取航路点列表。在现实数据集上进行的数值实验证实了所提出的方案快速构建精确的无线电贴图。

translated by 谷歌翻译

Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

K. Ram Prabhakar , Susmit Agrawal , R. Venkatesh Babu

分类：计算机视觉

2021-12-24

我们提出了一种新型的基于网络的基于网络的HDR Duthosting方法，用于融合任意长度的动态序列。所提出的方法使用卷积和经常性架构来产生视觉上令人愉悦的重影的HDR图像。我们介绍了一个新的反复间谍架构，即自动门控内存（SGM）单元格，这胜过标准LSTM单元格，同时包含更少的参数并具有更快的运行时间。在SGM小区中，通过将门的输出乘以自身的函数来控制通过门的信息流。此外，我们在双向设置中使用两个SGM单元来提高输出质量。该方法的方法与现有的HDR Deghosting方法定量跨三个公共数据集相比，实现了最先进的性能，同时同时实现熔断器可变长度输入顺序的可扩展性而不需要重新训练。通过广泛的消融，我们证明了各个组件以拟议方法的重要性。该代码可在https://val.cds.iisc.ac.in.in/hdr/hdrrn/index.html中获得。

translated by 谷歌翻译